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Cross Reference to Related Applications 

[1] This application is related to U.S. Patent Application No. (Attorney 

10 Docket No. SUNMP233), filed on even date herewith, and entitled "Bandwidth Matching 
Device and Method for Operating the Same," and U.S. Patent Application No. 

(Attorney Docket No. SUNMP234), filed on even date herewith, and entitled 

"Time Slicing Device for Shared Resources and Method for Operating the Same." The 
disclosures of both of these related applications are incorporated herein by reference. 

15 Background of the Invention 

1. Field of the Invention 

[2] The present invention relates generally to a digital memory. 

2. Description of the Related Art 

[3] In modem electronic devices, switches are used to connect a number of senders to 
20 a number of receivers, such that any of the senders can deliver a data stream (or packet) to 
any of the receivers. To accommodate variability in communication between the senders 
and receivers, the switches may use packet buffer memories to store packets received 
from the senders, which cannot immediately be transmitted to the intended receivers. 
[4] One conventional packet buffer memory configuration uses an input buffer 
25 connected to each of a number of input ports, wherein a different input buffer is 
connected to a different one of the number of input ports. Each of the different input 
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buffers associated with the number of input ports is connected to a switching substrate 
(e.g., crossbar switch or shared bus) which is connected to a number of output ports. In 
this configuration, if there are two packets stored in the same input buffer, but destined 
for different output ports, only one packet can make progress even if both output ports are 
5 available. This situation exists because the two packets share a common connectivity into 
the switching substrate. Also in this configuration, the input buffer associated with the 
input port that is receiving the packet must have sufficient space available to receive the 
entire packet. The packet cannot be stored in another input buffer associated with another 
input port. 

10 [5] Another conventional packet buffer memory configuration uses an output buffer 
connected to each of a number of output ports, wherein a different output buffer is 
connected to a different one of the number of output ports. Each of the different output 
buffers associated with the number of output ports are connected to a switching substrate 
(e.g., crossbar switch or shared bus) which is also connected to a number of input ports. 

15 In this configuration, if two packets arrive from two different input ports at the same time 
and are destined for a common output port, only one packet can make progress. This 
situation exists because the two packets must share a connectivity into the common 
output port. Also in this configuration, a packet may be stored in any available output 
buffer, but the entire packet must be stored in the same output buffer. Therefore, the 

20 output buffer must have sufficient space to store the packet before the packet can 
progress. 

[6] In either of the conventional packet buffer memory configurations mentioned 
above, a complicated buffer allocation mechanism is required to achieve efficient 
utilization of the input ports, output ports, and buffer memories. In order to avoid 
25 contention for resources when a new packet arrives, the system must take into account a 
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number of factors, among others, such as (a) availability of space in each buffer, (b) 
whether a packet is currently being deposited into or retrieved from each buffer, and (c) 
the number of packets that are currently waiting for deposition in each buffer. 
[7] In view of the foregoing, there is a need for a more efficient shared buffering 
5 solution that can be implemented without requiring either a complicated buffer allocation 
scheme or high speed memory circuitry. 
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Summary of the Invention 

[8] Broadly speaking, the present invention fills these needs by providing an 
apparatus for efficiently utilizing a shared packet buffer memory in a switch and a method 
for operating the same. More specifically, the apparatus of the present invention includes 
5 a memory having a number of buffers configured to be operated in a ratcheted manner. 
The ratcheted manner in which the memory is operated causes each incoming data stream 
to be distributed across the number of buffers. Since each incoming data stream is stored 
in a distributed manner, a complicated buffer allocation scheme is not required. Each 
stored data stream can also be retrieved from the number of buffers for output from the 
10 memory in a similar ratcheted manner. The memory uses a common rotating selector to 
control the ratcheted manner of operation. Also, the memory is capable of simultaneously 
servicing each of a number of inputs and a number of outputs to which the memory is 
connected. 

[9] In one embodiment, a switch is disclosed. The switch includes a set of inputs and 
15 a set of memory banks. A number of memory banks in the set of memory banks is equal 
to a number of inputs in the set of inputs. Each input is capable of transferring a data 
stream into the set of memory banks. The data stream transferred by each input is 
distributed across each of the set of memory banks. The switch further includes a set of 
outputs having a number outputs equal to the number of memory banks. Each data stream 
20 distributed across each of the set of memory banks is to be output from at least one output 
in the set of outputs. 

[10] In another embodiment, a memory is disclosed. The memory includes a number of 
inputs that are each capable of receiving a data stream to be stored in the memory. The 
memory also includes memory banks equal in number to the inputs. The memory banks 
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are for storing the data streams received by the inputs. A first ratcheting distributor is 
provided for distributing the data stream received by either of the inputs across the 
memory banks. The ratcheting distributor distributes the data stream such that one or 
more of the memory banks contains a distinct portion of the data stream. The memory 
5 further includes a number of outputs that are each capable of providing the stream 
previously stored in the memory. The distinct portion of the data stream contained within 
either of the memory banks is provided to either of the number of outputs by a second 
ratcheting distributor. 

[11] In another embodiment, a ratcheting distributor for distributing a data stream 
10 across a number of memory banks is disclosed. The ratcheting distributor includes a 
number of multiplexers that are each configured to receive a number of inputs and one of 
a number of selector signals. Each of the multiplexers has an output connected to one of 
the memory banks such that one of the multiplexers is connected to each of the memory 
banks. The selector signal received by each multiplexer is capable of controlling which of 
15 the number of inputs will be transmitted to the output. Each of the number of selector 
signals is defined by a different ordering of a number of individual signals. The number 
of individual signals is equal to the number of inputs. The ratcheting distributor also 
includes a rotating selector capable of generating the number of individual signals used to 
define the selector signals. On a given clock cycle, the rotating selector causes one of the 
20 individual signals to be a high signal and each of the remaining individual signals to be a 
low signal. The rotating selector further causes the high signal to be rotated among the 
number of individual signals on successive clock cycles. The ratcheting distributor also 
includes a clock for exercising the rotating selector. 

[12] In another embodiment, a method for operating a memory is disclosed. One aspect 
25 of the method includes receiving a number of inputs that each represent a data stream to 
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be stored in the memory, wherein each of the number of inputs can be received 
simultaneously. The data stream associated with each of the inputs is distributed across a 
number of memory banks. Distribution of the data stream across the memory banks is 
performed such that a portion of the data stream is stored in each of the memory banks as 
5 required to completely store the data stream. Another aspect of the method includes 
retrieving the portion of the data stream stored in each of the memory banks. The portion 
of the data stream retrieved from storage in each of the memory banks is transmitted to an 
output such that the data stream is provided at the output. 

[13] Other aspects of the invention will become more apparent from the following 
10 detailed description, taken in conjunction with the accompanying drawings, illustrating by 
way of example the present invention. 
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Brief Description of the Drawings 



[14] The invention, together with further advantages thereof, may best be understood 
by reference to the following description taken in conjunction with the accompanying 
drawings in which: 

5 Figure 1 is an illustration showing a memory, in accordance with one embodiment 

of the present invention; 

Figure 2 is an illustration showing a rotating selector (i.e., ratchet), in accordance 
with one embodiment of the present invention; 

Figure 3 is an illustration showing a table of selector signals corresponding to 
10 clock cycles 0 through 9, in accordance with the embodiment illustrated in Figures 1 and 
2; 

Figure 4A is an illustration showing the rotating selector at a clock cycle 0, in 
accordance with one embodiment of the present invention; 

Figure 4B is an illustration showing the memory at the clock cycle 0, in 
15 accordance with one embodiment of the present invention; 

Figure 5 A is an illustration showing the rotating selector at a clock cycle 1, in 
accordance with one embodiment of the present invention; 

Figure 5B is an illustration showing the memory at the clock cycle 1, in 
accordance with one embodiment of the present invention; 
20 Figure 6A is an illustration showing the rotating selector at a clock cycle 2, in 

accordance with one embodiment of the present invention; 

Figure 6B is an illustration showing the memory at the clock cycle 2, in 
accordance with one embodiment of the present invention; 
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Figure 7 A is an illustration showing the rotating selector at a clock cycle 3, in 
accordance with one embodiment of the present invention; 

Figure 7B is an illustration showing the memory at the clock cycle 3, in 
accordance with one embodiment of the present invention; 
5 Figure 8A is an illustration showing the rotating selector at a clock cycle 4, in 

accordance with one embodiment of the present invention; 

Figure 8B is an illustration showing the memory at the clock cycle 4, in 
accordance with one embodiment of the present invention; 

Figure 9 A is an illustration showing the rotating selector at a clock cycle 5, in 
10 accordance with one embodiment of the present invention; 

Figure 9B is an illustration showing the memory at the clock cycle 5, in 
accordance with one embodiment of the present invention; 

Figure lOA is an illustration showing the rotating selector at a clock cycle 6, in 
accordance with one embodiment of the present invention; 
15 Figure lOB is an illustration showing the memory at the clock cycle 6, in 

accordance with one embodiment of the present invention; 

Figure 11 A is an illustration showing the rotating selector at a clock cycle 7, in 
accordance with one embodiment of the present invention; 

Figure IIB is an illustration showing the memory at the clock cycle 7, in 
20 accordance with one embodiment of the present invention; 

Figure 12A is an illustration showing the rotating selector at a clock cycle 8, in 
accordance with one embodiment of the present invention; 

Figure 12B is an illustration showing the memory at the clock cycle 8, in 
accordance with one embodiment of the present invention; 
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Figure 13 A is an illustration showing the rotating selector at a clock cycle 9, in 
accordance with one embodiment of the present invention; 

Figure 13B is an illustration showing the memory at the clock cycle 9, in 
accordance with one embodiment of the present invention; and 
5 Figure 14 shows a flowchart illustrating a method for operating a memory, in 

accordance with one embodiment of the present invention. 
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Detailed Description 

[15] Broadly speaking, the present invention provides an apparatus for efficiently 
utilizing a shared packet buffer memory in a switch and a method for operating the same, 
^More specifically, the apparatus of the present invention includes a memory having a 
5 number of buffers configured to be operated in a ratcheted manner. The ratcheted manner 
in which the memory is operated causes each incoming data stream to be distributed 
across the number of buffers. Since each incoming data stream is stored in a distributed 
manner, a complicated buffer allocation scheme is not required. Each stored data stream 
can also be retrieved from the number of buffers for output from the memory in a similar 

10 ratcheted manner. The memory uses a rotating selector to control the ratcheted manner of 
operation. Also, the memory is capable of simultaneously servicing each of a number of 
inputs and a number of outputs to which the memory is connected. It should be 
appreciated that the present invention can be implemented in numerous ways, including 
as a process, an apparatus, a system, a device, or a method. Several exemplary 

15 embodiments of the invention will now be described in detail with reference to the 
accompanying drawings. 

[16] In the following description, numerous specific details are set forth in order to 
provide a thorough understanding of the present invention. It will be apparent, however, 
to one skilled in the art that the present invention may be practiced without some or all of 
20 these specific details. In other instances, well known process operations have not been 
described in detail in order not to unnecessarily obscure the present invention. 
[17] Figure 1 is an illustration showing a memory 100, in accordance with one 
embodiment of the present invention. The memory 100 is capable of accepting four inputs 
Ia. Ib, Ic» and Id and providing four outputs Oa, Ob, Oc, and Od- The number of inputs 
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and outputs associated with the memory 100 are selected for exemplary purposes. In other 
embodiments, the memory 100 can include a different number of inputs and outputs. 
Each of the inputs U, Ib, Ic, and Id are connected to four multiplexers (MUXs) 103A, 
103B, 103C, and 103D. For purposes of illustration, a bus lOlA is used to connect the 
5 four inputs U, Ib, Ic, and Id to each of the MUXs 103A, 103B, 103C, and 103D. In 
actuality, the bus 101 A can be considered to represent direct connections between each of 
the inputs Ia, Ib, Ic, and Id and each of the MUXs 103A, 103B, 103C, and 103D. Each 
MUX 103 A, 103B, 103C, and 103D receives a selector signal Swo, Swi, Sw2, and Sw3, 
respectively. Also, each MUX 103A, 103B, 103C, and 103D has an output Wo, Wi, W2, 

10 and W3, respectively, connected to a buffer (i.e., a memory bank) 105A, 105B, 105C, and 
105D, respectively. In this manner, each output Wo, Wi, W2, and W3 serves as a write 
input to each buffer 105A, 105B, 105C, and 105D, respectively. Each selector signal Swo, 
Swi, Sw2, and Sw3 is used to control which of the inputs Ia, Ib, Ic, and Id will be 
transmitted to the output Wq, Wi, W2, and W3 of each of the MUXs 103 A, 103B, 103C, 

15 and 103D, respectively. Thus, each selector signal Swo, Swi, Sw2, and Sw3 controls which 
of the inputs Ia, Ib^ Ic, and Id will be written through the outputs Wo, Wi, W2, and W3 to 
each of the buffers 105A, 105B, 105C, and 105D, respectively. 

[18] Each buffer 105A, 105B, 105C, and 105D contains a number of data storage 
locations 107 that are each capable of storing a word of data. Each buffer 105 A, 105B, 

20 105C, and 105D is also configured to communicate with an addressing controller 108. 
The addressing controller 108 provides each buffer with information related to which data 
storage location 107 is to be access in a particular input or output operation. Each buffer 
105A, 105B, 105C, and 105D is also dual-ported to have the capability of simultaneously 
receiving input and providing output. In following, each buffer 105A, 105B, 105C, and 

25 105D is connected to a read output Ro, Ri, R2, and R3, respectively. Each read output Ro, 
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Ri, R2, and R3 is connected to four MUXs 103E, 103F 103G, and 103H. For purposes of 
illustration, a bus lOlB is used to connect each read output Rq, Ri, R2, and R3 to each of 
the MUXs 103E, 103F 103G, and 103H. In actuality, the bus lOlB can be considered to 
represent direct connections between each read output Ro, Ri, R2, and R3 and each of the 
5 MUXs 103E, 103F 103G, and 103H. Each MUX 103E, 103F, 103G, and 103H receives a 
selector signal Sro, Sri, Sr2, and Sr3, respectively. Also, each MUX 103E, 103F, 103G, 
and 103H has an output Oa, Ob, Oc, and Od, respectively, corresponding to the outputs of 
the memory 100. Each of the inputs U, Ib, Ic, and Id, each of the write inputs Wq, Wi, W2, 
and W3, each of the read outputs Rq, Ri, R2, and R3, and each of the outputs Oa, Ob, Oc, 

10 and Od have the same bandwidth capability. 

[19] In the embodiment of Figure 1, each of the selector signals Swo, Swi, Sw2, Sw3, 
Sro, Sri, Sr2, and Sr3 consists of four 1-bit signals with one of the 1-bit signals having a 
first digital state and the remainder of the 1-bit signals having a second digital state. In 
one embodiment, the first digital state is a high state and the second digital state is a low 

15 state. In another embodiment, the first digital state is a low state and the second digital 
state is a high state. For purposes of discussion and illustration, the remainder of this 
document will consider the first digital state to be a high state and the second digital state 
to be a low state. The number of 1-bit signals in the selector signal of a particular MUX is 
equal to the number of inputs received by the MUX. Since each of the MUXs 103A-103H 

20 is configured to receive four inputs, each selector signal contains four 1-bit signals. In 
other embodiments, the memory 100 may include a different number of inputs and 
buffers with a correspondingly different number of inputs to each MUX. In these other 
embodiments, the number of 1-bit signals in the selector signal of a particular MUX will 
continue to be equal to the number of inputs received by the MUX, with one 1-bit signal 

25 being high at a given time. 
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[20] The ordering of the 1-bit signals in the selector signals vary between the MUXs. 
Selector signals Swo and Sro have a 1-bit signal ordering of {83,82,81,80} and correspond 
to MUXs 103 A and 103E, respectively. Selector signals 8wi and Sri have a 1-bit signal 
ordering of {80,83,82,81} and correspond to MUXs 103B and 103F, respectively. Selector 
5 signals Sw2 and Sr2 have a 1-bit signal ordering of {81,80,83,82} and correspond to 
MUXs 103C and 103G, respectively. Selector signals Swa and 8r3 have a 1-bit signal 
ordering of {82,81,80,83} and correspond to MUXs 103D and 103H, respectively. Each of 
the selector signals maintains a common sequence of 1-bit signals. However, the common 
sequence of 1-bit signals is successively shifted by a bit position to create each of the 

10 selector signals. Thus, the high 1-bit signal will have a different bit position in each of the 
selector signals at a given time. The high 1-bit signal determines which of the inputs 
received by the MUX will be transmitted to the output of the MUX. 
[21] With respect to MUX 103A, if the 1-bit signal So is high in selector signal Swo, 
input Ia will be transmitted to output Wo- If the 1-bit signal Si is high in selector signal 

15 Swo, input Ib will be transmitted to output Wo. If the 1-bit signal 82 is high in selector 
signal Swo, input Ic will be transmitted to output Wo. If the 1-bit signal S3 is high in 
selector signal Swo, input Id will be transmitted to output Wq. 

[22] With respect to MUX 103B, if the 1-bit signal Si is high in selector signal Swi, 
input Ia will be transmitted to output Wi. If the 1-bit signal 82 is high in selector signal 
20 Swi, input Ib will be transmitted to output Wi. If the 1-bit signal 83 is high in selector 
signal Swi, input Ic will be transmitted to output Wi. If the 1-bit signal So is high in 
selector signal Swi, input Id will be transmitted to output Wi. 

[23] With respect to MUX 103C, if the 1-bit signal 82 is high in selector signal 8w2, 
input Ia will be transmitted to output W2. If the 1-bit signal S3 is high in selector signal 
25 Sw2, input Ib will be transmitted to output W2. If the 1-bit signal 80 is high in selector 
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signal Sw2, input Ic will be transmitted to output W2. If the 1-bit signal Si is high in 
selector signal Sw2, input Id will be transmitted to output W2. 

[24] With respect to MUX 103D, if the 1-bit signal S3 is high in selector signal Sw3, 
input Ia will be transmitted to output W3. If the 1-bit signal So is high in selector signal 
5 Sw3, input Ib will be transmitted to output W3. If the 1-bit signal Si is high in selector 
signal Sw3, input Ic will be transmitted to output W3. If the 1-bit signal S2 is high in 
selector signal Sw3, input Id will be transmitted to output W3. 

[25] With respect to MUX 103E, if the 1-bit signal So is high in selector signal Sro, 
input Ro will be transmitted to output Oa. If the 1-bit signal Si is high in selector signal 
10 Sro, input Ri will be transmitted to output Oa. If the 1-bit signal S2 is high in selector 
signal Sro, input R2 will be transmitted to output Oa. If the 1-bit signal S3 is high in 
selector signal Sro, input R3 will be transmitted to output Oa. 

[26] With respect to MUX 103F, if the 1-bit signal Si is high in selector signal Sri, 
input Ro will be transmitted to output Ob. If the 1-bit signal S2 is high in selector signal 
15 Sri, input Ri will be transmitted to output Ob- If the 1-bit signal S3 is high in selector 
signal Sri, input R2 will be transmitted to output Ob. If the 1-bit signal So is high in 
selector signal Sri, input R3 will be transmitted to output Ob. 

[27] With respect to MUX 103G, if the 1-bit signal S2 is high in selector signal Sr2, 
input Ro will be transmitted to output Oc- If the 1-bit signal S3 is high in selector signal 
20 Sr2, input Ri will be transmitted to output Oc- If the 1-bit signal So is high in selector 
signal Sr2, input R2 will be transmitted to output Oc. If the 1-bit signal Si is high in 
selector signal Sr2, input R3 will be transmitted to output Oc. 

[28] With respect to MUX 103H, if the 1-bit signal S3 is high in selector signal Sr3, 
input Ro will be transmitted to output Od. If the 1-bit signal So is high in selector signal 
25 Sr3, input Ri will be transmitted to output Od- If the 1-bit signal Si is high in selector 

SUNMP232/ASP/KDW 14 PATENT APPLICATION 



signal Sr3, input R2 will be transmitted to output Od. If the 1-bit signal S2 is high in 
selector signal Sr3, input R3 will be transmitted to output Od- 

[29] Figure 2 is an illustration showing a rotating selector (i.e., ratchet) 200, in 
accordance with one embodiment of the present invention. The rotating selector 200 
5 generates the 1-bit signals used to define each of the selector signals. The rotating selector 
200 includes a number of cells 201A-201D corresponding to the number of 1-bit signals 
needed to generate each of the selector signals. Each cell 201A-201D receives a binary 
input (d) and a clock input and generates a binary output (q). The exemplary embodiment 
of Figure 2 corresponds to the memory 100 of Figure 1. Thus, four 1-bit signals So, Si, S2, 

10 and S3 are generated by the cells 201A, 201B, 201C, and 201D, respectively. In one 
embodiment, each of the cells 201A-201D can be an edge-triggered cell such as a flip- 
flop. Upon receipt of a clock signal indicating a clock cycle, each of the cells 201A-201D 
will change the value of its internal state, and therefore its output (q), to take on the 
current value of its input. Thus, each of the cells 201A-201D transmits its state to the next 

15 cell in the sequence at each clock cycle. 

[30] Each of the cells 201 A-201D also includes a reset and set connection. Receipt of a 
high signal on the reset connection will cause the cell to hold a low signal. Receipt of a 
high signal on the set connection will cause the cell to hold a high signal. In the 
embodiment of Figure 2, a reset/initialize signal is connected to the set connection of cell 

20 201 A and to the reset connection of cells 201B-201D. Thus, in the embodiment of Figure 
2, the cells 201A-201D can be initialized or reset such that cell 201A holds a high signal 
and each of cells 201B-201D holds a low signal. After initialization, the first clock cycle 
will cause cell 201B to generate a high signal and each of cells 201 A, 201C, and 201D to 
generate a low signal. Thus, before the first clock cycle, So is a high signal and Si, S2, and 

25 S3 are low signals. In succeeding clock cycles, the high signal will rotate in sequence 

SUNMP232/ASP/KDW 15 PATENT APPLICATION 



from So to Si to S2 to S3 and back to So to start the rotation again. Therefore, one instance 
of the rotating selector 200 circuitry is needed to generate all of the 1-bit signals So, Si, 
S2, and S3 needed to define each of the selector signals Swo, Swi, Sw2, Sw3, Sro, Sri, Sr2, 
and Sr3. Hence, each of the selector signals Swo, Swi, Sw2, Sw3, Sro, Sri, Sr2, and Sr3 can 
5 be defined without using explicit arbitration circuitry. 

[31] Figure 3 is an illustration showing a table of selector signals corresponding to 
clock cycles 0 through 9, in accordance with the embodiment illustrated in Figures 1 and 
2. In clock cycle 0, the 1-bit signal So is high and the remaining 1-bit signals Si, S2, and 
S3 are low. In clock cycle 1, the 1-bit signal Si is high and the remaining 1-bit signals S2, 

10 S3, and So are low. In clock cycle 2, the 1-bit signal S2 is high and the remaining 1-bit 
signals S3, So, and Si are low. In clock cycle 3, the 1-bit signal S3 is high and the 
remaining 1-bit signals So, Si, and S2 are low. In clock cycle 4, the 1-bit signal So is high 
again and the remaining 1-bit signals Si, S2, and S3 are low again. The rotating of the high 
signal among the 1-bit signals continues in the same manner through the remaining clock 

15 cycles. 

[32] The rotation of the high signal among the 1-bit signals at each clock cycle has a 
corresponding effect on the selection signals for each of the MUXs. At a given clock 
cycle, the high signal occupies a different bit position in each of the selector signals 
Swo/Sro, Swi/Sri, Sw2/Sr2, and Sw3/Sr3. In this manner, the selector signals Swo, Swi, 

20 Sw2, and Sw3 control the MUXs 103A, 103B, 103C, and 103D, respectively, such that a 
data stream arriving at a given input will be distributed among the buffers 105A, 105B, 
105C, and 105D as the clock cycles. In a similar manner, the selector signals Sro, Sri, 
Sr2, and Sr3 control the MUXs 103E, 103F, 103G, and 103H, respectively, such that a 
data stream to be provided to a given output will be retrieved from the buffers 105A, 

25 105B, 105C, and 105D as the clock cycles. Thus, the MUXs 103 A, 103B, 103C, and 
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103D, along with the rotating selector 200, serve as a ratcheting distributor for 
distributing the data streams arriving at the various inputs Ia, Ib» Ic> and Id among the 
buffers 105A, 105B, 105C, and 105D. Also, the MUXs 103E, 103F, 103G, and 103H, 
along with the rotating selector 200, serve as a ratcheting distributor for retrieving and 
5 transmitting the distributed data streams from among the buffers 105A, 105B, 105C, and 
105D to the outputs Oa, Ob, Oc, and Od- The operation of the memory 100 can be 
described more completely by way of example. Figures 4A-13B show an example of the 
rotating selector 200 and the memory 100 operation through ten clock cycles (cycle 0 
through cycle 9). 

10 [33] Figure 4A is an illustration showing the rotating selector 200 at a clock cycle 0, in 
accordance with one embodiment of the present invention. Figure 4B is an illustration 
showing the memory 100 at the clock cycle 0, in accordance with one embodiment of the 
present invention. Prior to the clock cycle 0 a number of data streams arrive at inputs Ia, 
Ib, Ic, and Id- A memory controller receives requests from inputs Ia, Ib, Ic, and Id to store 

15 the incoming data streams. The memory controller instructs the memory 100 to begin 
storing data streams incoming at inputs Ia, Ib, Ic, and Id when Swo is {0001}, {0010}, 
{0100}, and { 1000} , respectively. 

[34] At the clock cycle 0, Swo is {0001}. Therefore, at the clock cycle 0, MUX 103 A 
transmits a portion (i.e., a word) of the input Ia data stream through output Wq to buffer 

20 105A. Also, at or before the clock cycle 0 the addressing controller 108 establishes a data 
block across buffers 105A-105D for storage of the input Ia data stream. For illustrative 
purposes, the data block established for the input Ia data stream is identified by bracketed 
regions labeled Ia.O at each buffer 105A-105D. The label Ia-O refers to input Ia and data 
stream 0. In the current example, three data storage locations 107 are allocated within 

25 each buffer 105A-105D for storage of the input Ia data stream. The portion of the input Ia 
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data stream stored in buffer 105A at the clock cycle 0 is indicated by Ia-O.O, wherein the 
input identifier is Ia, the data stream identifier is 0, and the portion identifier is 0. 
[35] In one embodiment, the addressing controller 108 will be provided information 
regarding the size of the incoming data stream to determine the number of data storage 
5 locations needed within each buffer to completely store the data stream. In one 
embodiment, the information regarding the size of the incoming data stream may be 
contained within a header of the data stream. In another embodiment, the information 
regarding the size of the incoming data stream may be discemable from the source of the 
incoming data stream. 

10 [36] Figure 5 A is an illustration showing the rotating selector 200 at a clock cycle 1, in 
accordance with one embodiment of the present invention. Figure 5B is an illustration 
showing the memory 100 at the clock cycle 1, in accordance with one embodiment of the 
present invention. At the clock cycle 1, Swo is {0010}. Therefore, at the clock cycle 1, 
MUX 103 A transmits a portion of the input Ib data stream through output Wo to buffer 

15 105A. Also, at or before the clock cycle 1 the addressing controller 108 establishes a data 
block across buffers 105A-105D for storage of the input Ib data stream. For illustrative 
purposes, the data block established for the input Ib data stream is identified by bracketed 
regions labeled Ib.O at each buffer 105A-105D. The label Ib.O refers to input Ib and data 
stream 0. In the current example, three data storage locations 107 are allocated within 

20 each buffer 105A-105D for storage of the input Ib data stream. The portion of the input Ib 
data stream stored in buffer 105A at the clock cycle 1 is indicated by Ib-O.O, wherein the 
input identifier is Ib, the data stream identifier is 0, and the portion identifier is 0. 
[37] Also, at the clock cycle 1, the selector signal Swi is {0001 } which instructs MUX 
103B to transmit a next portion of the input Ia data stream through output Wi to buffer 

25 105B. The addressing controller 108 directs the buffer 105B to store the next portion of 
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the input Ia data stream at the next available data storage location 107 allocated for Ia.O in 
the buffer 105B. The next portion of the input Ia data stream stored in buffer 105B at the 
clock cycle 1 is indicated by Ia.0.1, wherein the portion identifier is 1. 
[38] Figure 6A is an illustration showing the rotating selector 200 at a clock cycle 2, in 
5 accordance with one embodiment of the present invention. Figure 6B is an illustration 
showing the memory 100 at the clock cycle 2, in accordance with one embodiment of the 
present invention. At the clock cycle 2, Swo is {0100}. Therefore, at the clock cycle 2, 
MUX 103 A transmits a portion of the input Ic data stream through output Wq to buffer 
105A. Also, at or before the clock cycle 2 the addressing controller 108 establishes a data 

10 block across buffers 105A-105D for storage of the input Ic data stream. For illustrative 
purposes, the data block established for the input Ic data stream is identified by bracketed 
regions labeled Ic.O at each buffer 105A-105D. The label Ic.O refers to input Ic and data 
stream 0. In the current example, three data storage locations 107 are allocated within 
each buffer 105A-105D for storage of the input Ic data stream. The portion of the input Ic 

15 data stream stored in buffer 105A at the clock cycle 2 is indicated by Ic-O.O, wherein the 
input identifier is Ic, the data stream identifier is 0, and the portion identifier is 0. 
[39] Also, at the clock cycle 2, the selector signal Swi is {0010} which instructs MUX 
103B to transmit a next portion of the input Ib data stream through output Wi to buffer 
105B. The addressing controller 108 directs the buffer 105B to store the next portion of 

20 the input Ib data stream at the next available data storage location 107 allocated for Ib.O in 
the buffer 105B. The next portion of the input Ib data stream stored in buffer 105B at the 
clock cycle 2 is indicated by Ib.0.1, wherein the portion identifier is 1. 
[40] Also, at the clock cycle 2, the selector signal Sw2 is {0001 } which instructs MUX 
103C to transmit a next portion of the input Ia data stream through output W2 to buffer 

25 105C, The addressing controller 108 directs the buffer 105C to store the next portion of 
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the input Ia data stream at the next available data storage location 107 allocated for Ia.O in 
the buffer 105C. The next portion of the input U data stream stored in buffer 105C at the 
clock cycle 2 is indicated by Ia.0.2, wherein the portion identifier is 2. 
[41] Figure 7A is an illustration showing the rotating selector 200 at a clock cycle 3, in 
5 accordance with one embodiment of the present invention. Figure 7B is an illustration 
showing the memory 100 at the clock cycle 3, in accordance with one embodiment of the 
present invention. At the clock cycle 3, Swo is {1000}. Therefore, at the clock cycle 3, 
MUX 103A transmits a portion of the input Id data stream through output Wq to buffer 
105A. Also, at or before the clock cycle 3 the addressing controller 108 establishes a data 

10 block across buffers 105A-105D for storage of the input Ip data stream. For illustrative 
purposes, the data block established for the input Id data stream is identified by bracketed 
regions labeled Id-O at each buffer 105A-105D. The label Id.O refers to input Id and data 
stream 0. Notice that the data block established for the input Id data stream is not 
immediately adjacent to the data block established for the input Ic data stream. The 

15 addressing controller 108 can establish a data block for an incoming data stream at any 
available location in memory. In one embodiment, the addressing controller 108 is 
operated to establish data blocks for incoming data streams in a manner that will most 
efficiently utilize the available memory. In the current example, three data storage 
locations 107 are allocated within each buffer 105A-105D for storage of the input Id data 

20 stream. The portion of the input Id data stream stored in buffer 105A at the clock cycle 3 
is indicated by Id.0.0, wherein the input identifier is Id, the data stream identifier is 0, and 
the portion identifier is 0. 

[42] Also, at the clock cycle 3, the selector signal Swi is {0100} which instructs MUX 
103B to transmit a next portion of the input Ic data stream through output Wi to buffer 
25 105B. The addressing controller 108 directs the buffer 105B to store the next portion of 

SUNMP232/ASP/KDW 20 PATENT APPLICATION 



the input Ic data stream at the next available data storage location 107 allocated for Ic.O in 
the buffer 105B. The next portion of the input Ic data stream stored in buffer 105B at the 
clock cycle 3 is indicated by Ic.0.1, wherein the portion identifier is 1. 
[43] Also, at the clock cycle 3, the selector signal Sw2 is {0010} which instructs MUX 
5 103C to transmit a next portion of the input Ib data stream through output W2 to buffer 
105C. The addressing controller 108 directs the buffer 105C to store the next portion of 
the input Ib data stream at the next available data storage location 107 allocated for Ib.O in 
the buffer 105C. The next portion of the input Ib data stream stored in buffer 105C at the 
clock cycle 3 is indicated by Ib.0.2, wherein the portion identifier is 2. 

10 [44] Also, at the clock cycle 3, the selector signal Sw3 is {0001 } which instructs MUX 
103D to transmit a next portion of the input Ia data stream through output W3 to buffer 
105D. The addressing controller 108 directs the buffer 105D to store the next portion of 
the input Ia data stream at the next available data storage location 107 allocated for Ia.O in 
the buffer 105D. The next portion of the input Ia data stream stored in buffer 105D at the 

15 clock cycle 3 is indicated by Ia.0.3, wherein the portion identifier is 3. 

[45] Figure 8A is an illustration showing the rotating selector 200 at a clock cycle 4, in 
accordance with one embodiment of the present invention. Figure 8B is an illustration 
showing the memory 100 at the clock cycle 4, in accordance with one embodiment of the 
present invention. At the clock cycle 4, the selector signal Swo is {0001 } which instructs 

20 MUX 103 A to transmit a next portion of the input Ia data stream through output Wo to 
buffer 105A. The addressing controller 108 directs the buffer 105A to store the next 
portion of the input Ia data stream at the next available data storage location 107 allocated 
for Ia.O in the buffer 105A. The next portion of the input Ia data stream stored in buffer 
105 A at the clock cycle 4 is indicated by Ia.0.4, wherein the portion identifier is 4, 
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[46] Also, at the clock cycle 4, the selector signal Swi is { 1000} which instructs MUX 
103B to transmit a next portion of the input Id data stream through output Wi to buffer 
105B. The addressing controller 108 directs the buffer 105B to store the next portion of 
the input Id data stream at the next available data storage location 107 allocated for Id.O in 
5 the buffer 105B. The next portion of the input Id data stream stored in buffer 105B at the 
clock cycle 4 is indicated by Id.0.1, wherein the portion identifier is 1. 
[47] Also, at the clock cycle 4, the selector signal Sw2 is {0100} which instructs MUX 
103C to transmit a next portion of the input Ic data stream through output W2 to buffer 
105C. The addressing controller 108 directs the buffer 105C to store the next portion of 

10 the input Ic data stream at the next available data storage location 107 allocated for Ic.O in 
the buffer 105C. The next portion of the input Ic data stream stored in buffer 105C at the 
clock cycle 4 is indicated by Ic.0.2, wherein the portion identifier is 2. 
[48] Also, at the clock cycle 4, the selector signal Sw3 is {0010} which instructs MUX 
103D to transmit a next portion of the input Ib data stream through output W3 to buffer 

15 105D. The addressing controller 108 directs the buffer 105D to store the next portion of 
the input Ib data stream at the next available data storage location 107 allocated for Ib.O in 
the buffer 105D. The next portion of the input Ib data stream stored in buffer 105D at the 
clock cycle 4 is indicated by Ib.0.3, wherein the portion identifier is 3. 
[49] Figure 9A is an illustration showing the rotating selector 200 at a clock cycle 5, in 

20 accordance with one embodiment of the present invention. Figure 9B is an illustration 
showing the memory 100 at the clock cycle 5, in accordance with one embodiment of the 
present invention. At the clock cycle 5, the selector signal Swo is {0010} which instructs 
MUX 103A to transmit a next portion of the input Ib data stream through output Wq to 
buffer 105A. The addressing controller 108 directs the buffer 105A to store the next 

25 portion of the input Ib data stream at the next available data storage location 107 allocated 
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for Ib.O in the buffer 105A. The next portion of the input Ib data stream stored in buffer 
105A at the clock cycle 5 is indicated by Ib.0.4, wherein the portion identifier is 4. 
[50] Also, at the clock cycle 5, the selector signal Swi is {0001 } which instructs MUX 
103B to transmit a next portion of the input Ia data stream through output Wi to buffer 
5 105B. The addressing controller 108 directs the buffer 105B to store the next portion of 
the input Ia data stream at the next available data storage location 107 allocated for Ia.O in 
the buffer 105B. The next portion of the input U data stream stored in buffer 105B at the 
clock cycle 5 is indicated by Ia.0.5, wherein the portion identifier is 5. 
[51] Also, at the clock cycle 5, the selector signal Sw2 is { 1000} which instructs MUX 

10 103C to transmit a next portion of the input Id data stream through output W2 to buffer 
105C. The addressing controller 108 directs the buffer 105C to store the next portion of 
the input Id data stream at the next available data storage location 107 allocated for Id.O in 
the buffer 105C. The next portion of the input Id data stream stored in buffer 105C at the 
clock cycle 5 is indicated by Id.0.2, wherein the portion identifier is 2. 

15 [52] Also, at the clock cycle 5, the selector signal Sw3 is {0100} which instructs MUX 
103D to transmit a next portion of the input Ic data stream through output W3 to buffer 
105D. The addressing controller 108 directs the buffer 105D to store the next portion of 
the input Ic data stream at the next available data storage location 107 allocated for Ic-O in 
the buffer 105D. The next portion of the input Ic data stream stored in buffer 105D at the 

20 clock cycle 5 is indicated by Ic.0.3, wherein the portion identifier is 3. 

[53] Continuing with the current example, prior to the clock cycle 5, the memory 
controller receives requests to provide the stored data streams Ic-O, Id.O, Ia.O, and Ib.O to 
outputs Oa, Ob, Oc, and Od, respectively. The memory controller instructs the memory 
100 to begin providing the stored data streams to outputs Oa, Ob, Oc, and Od when Srq, 

25 Sri, Sr2, and Sr3, respectively, are {0001}. At the clock cycle 5, Sri is {0001}. 
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Therefore, at the clock cycle 5, MUX 103F transmits the Id.0.0 portion of the input Id 
data stream from the buffer 105A output Ro to output Ob. 

[54] Figure lOA is an illustration showing the rotating selector 200 at a clock cycle 6, 
in accordance with one embodiment of the present invention. Figure lOB is an illustration 
5 showing the memory 100 at the clock cycle 6, in accordance with one embodiment of the 
present invention. At the clock cycle 6, the selector signal Swo is {0100} which instructs 
MUX 103A to transmit a next portion of the input Ic data stream through output Wq to 
buffer 105A. The addressing controller 108 directs the buffer 105A to store the next 
portion of the input Ic data stream at the next available data storage location 107 allocated 

10 for Ic.O in the buffer 105A. The next portion of the input Ic data stream stored in buffer 
105A at the clock cycle 6 is indicated by Ic.0.4, wherein the portion identifier is 4. 
[55] Also, at the clock cycle 6, the selector signal Swi is {0010} which instructs MUX 
103B to transmit a next portion of the input Ib data stream through output Wi to buffer 
105B. The addressing controller 108 directs the buffer 105B to store the next portion of 

15 the input Ib data stream at the next available data storage location 107 allocated for Ib.O in 
the buffer 105B. The next portion of the input Ib data stream stored in buffer 105B at the 
clock cycle 6 is indicated by Ib.0.5, wherein the portion identifier is 5. 
[56] Also, at the clock cycle 6, the selector signal Sw2 is {0001} which instructs MUX 
103C to transmit a next portion of the input Ia data stream through output W2 to buffer 

20 105C. The addressing controller 108 directs the buffer 105C to store the next portion of 
the input Ia data stream at the next available data storage location 107 allocated for Ia.O in 
the buffer 105C. The next portion of the input Ia data stream stored in buffer 105C at the 
clock cycle 6 is indicated by Ia.0.6, wherein the portion identifier is 6. 
[57] Also, at the clock cycle 6, the selector signal Sw3 is { 1000} which instructs MUX 

25 103D to transmit a next portion of the input Id data stream through output W3 to buffer 
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105D. The addressing controller 108 directs the buffer 105D to store the next portion of 
the input Id data stream at the next available data storage location 107 allocated for Id.O in 
the buffer 105D. The next portion of the input Id data stream stored in buffer 105D at the 
clock cycle 6 is indicated by Id.0.3, wherein the portion identifier is 3. 
5 [58] Also, at the clock cycle 6, Sr2 is {0001}. Therefore, at the clock cycle 6, MUX 
103G transmits the Ia.O.O portion of the input Ia data stream from the buffer 105A output 
Ro to output Oc- 

[59] Also, at the clock cycle 6, Sri is {0010}. Therefore, at the clock cycle 6, MUX 
103G transmits the Id-O.I portion of the input Id data stream from the buffer 105B output 
10 Ri to output Ob. 

[60] Figure llA is an illustration showing the rotating selector 200 at a clock cycle 7, 
in accordance with one embodiment of the present invention. Figure 1 IB is an illustration 
showing the memory 100 at the clock cycle 7, in accordance with one embodiment of the 
present invention. At the clock cycle 7, the selector signal Swo is { 1000} which instructs 

15 MUX 103 A to transmit a next portion of the input Id data stream through output Wq to 
buffer 105A. The addressing controller 108 directs the buffer 105A to store the next 
portion of the input Id data stream at the next available data storage location 107 allocated 
for Id.O in the buffer 105A. The next portion of the input Id data stream stored in buffer 
105A at the clock cycle 7 is indicated by Id.0.4, wherein the portion identifier is 4. 

20 [61] Also, at the clock cycle 7, the selector signal Swi is {0100} which instructs MUX 
103B to transmit a next portion of the input Ic data stream through output Wi to buffer 
105B. The addressing controller 108 directs the buffer 105B to store the next portion of 
the input Ic data stream at the next available data storage location 107 allocated for Ic.O in 
the buffer 105B. The next portion of the input Ic data stream stored in buffer 105B at the 

25 clock cycle 7 is indicated by Ic.0.5, wherein the portion identifier is 5. 
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[62] Also, at the clock cycle 7, the selector signal Sw2 is {0010} which instructs MUX 
103C to transmit a next portion of the input Ib data stream through output W2 to buffer 
105C. The addressing controller 108 directs the buffer 105C to store the next portion of 
the input Ir data stream at the next available data storage location 107 allocated for Ib.O in 
5 the buffer 105C. The next portion of the input Ib data stream stored in buffer 105C at the 
clock cycle 7 is indicated by Ib.0.6, wherein the portion identifier is 6. 
[63] Also, at the clock cycle 7, the selector signal Sw3 is {0001 } which instructs MUX 
103D to transmit a next portion of the input Ia data stream through output W3 to buffer 
105D. The addressing controller 108 directs the buffer 105D to store the next portion of 

10 the input Ia data stream at the next available data storage location 107 allocated for U.O in 
the buffer 105D. The next portion of the input Ia data stream stored in buffer 105D at the 
clock cycle 7 is indicated by Ia.0.7, wherein the portion identifier is 7. 
[64] Also, at the clock cycle 7, Sr3 is {0001}. Therefore, at the clock cycle 7, MUX 
103H transmits the Ib.0.0 portion of the input Ib data stream from the buffer 105A output 

15 Ro to output Oq. 

[65] Also, at the clock cycle 7, Sr2 is {0010}. Therefore, at the clock cycle 7, MUX 
103G transmits the Ia-O.I portion of the input U data stream from the buffer 105B output 
Ri to output Oc. 

[66] Also, at the clock cycle 7, Sri is {0100}. Therefore, at the clock cycle 7, MUX 
20 103F transmits the Id.0.2 portion of the input Id data stream from the buffer 105C output 
R2 to output Ob- 

[67] Figure 12A is an illustration showing the rotating selector 200 at a clock cycle 8, 
in accordance with one embodiment of the present invention. Figure 12B is an illustration 
showing the memory 100 at the clock cycle 8, in accordance with one ernbodiment of the 
25 present invention. At the clock cycle 8, the selector signal Swo is {0001} which instructs 
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MUX 103A to transmit a next portion of the input Ia data stream through output Wo to 
buffer 105A. The addressing controller 108 directs the buffer 105A to store the next 
portion of the input Ia data stream at the next available data storage location 107 allocated 
for Ia.O in the buffer 105A. The next portion of the input Ia data stream stored in buffer 
5 105 A at the clock cycle 8 is indicated by Ia.O. 8, wherein the portion identifier is 8. 

[68] Also, at the clock cycle 8, the selector signal Swi is { 1000} which instructs MUX 
103B to transmit a next portion of the input Id data stream through output Wi to buffer 
105B. The addressing controller 108 directs the buffer 105B to store the next portion of 
the input Id data stream at the next available data storage location 107 allocated for Id.O in 

10 the buffer 105B. The next portion of the input Id data stream stored in buffer 105B at the 
clock cycle 8 is indicated by Id.0.5, wherein the portion identifier is 5. 
[69] Also, at the clock cycle 8, the selector signal Sw2 is {0100} which instructs MUX 
103C to transmit a next portion of the input Ic data stream through output W2 to buffer 
105C. The addressing controller 108 directs the buffer 105C to store the next portion of 

15 the input Ic data stream at the next available data storage location 107 allocated for Ic.O in 
the buffer 105C. The next portion of the input Ic data stream stored in buffer 105C at the 
clock cycle 8 is indicated by Ic.0.6, wherein the portion identifier is 6. 
[70] Also, at the clock cycle 8, the selector signal Sw3 is {0010} which instructs MUX 
103D to transmit a next portion of the input Ib data stream through output W3 to buffer 

20 105D. The addressing controller 108 directs the buffer 105D to store the next portion of 
the input Ib data stream at the next available data storage location 107 allocated for Ib.O in 
the buffer 105D. The next portion of the input Ib data stream stored in buffer 105D at the 
clock cycle 8 is indicated by Ib.0.7, wherein the portion identifier is 7. 
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[71] Also, at the clock cycle 8, Sro is {0001}. Therefore, at the clock cycle 8, MUX 
103E transmits the Ic-O.O portion of the input Ic data stream from the buffer 105A output 
Ro to output Oa. 

[72] Also, at the clock cycle 8, Srs is {0010}. Therefore, at the clock cycle 8, MUX 
5 103H transmits the Ib O.I portion of the input Ib data stream from the buffer 105B output 
Ri to output Od. 

[73] Also, at the clock cycle 8, Sr2 is {0100}. Therefore, at the clock cycle 8, MUX 
103G transmits the Ia.0.2 portion of the input Ia data stream from the buffer 105C output 
R2 to output Oc. 

10 [74] Also, at the clock cycle 8, Sri is {1000}. Therefore, at the clock cycle 8, MUX 
103F transmits the Id.0.3 portion of the input Ip data stream from the buffer 105D output 
R3 to output Ob. 

[75] Figure 13A is an illustration showing the rotating selector 200 at a clock cycle 9, 
in accordance with one embodiment of the present invention. Figure 13B is an illustration 

15 showing the memory 100 at the clock cycle 9, in accordance with one embodiment of the 
present invention. At the clock cycle 9, the selector signal Swo is {0010} which instructs 
MUX 103A to transmit a next portion of the input Ib data stream through output Wo to 
buffer 105 A. The addressing controller 108 directs the buffer 105 A to store the next 
portion of the input Ib data stream at the next available data storage location 107 allocated 

20 for Ib.O in the buffer 105A. The next portion of the input Ib data stream stored in buffer 
105A at the clock cycle 9 is indicated by Ib.0.8, wherein the portion identifier is 8. 
[76] Also, at the clock cycle 9, the selector signal Swi is {0001 } which instructs MUX 
103B to transmit a next portion of the input Ia data stream through output W] to buffer 
105B. The addressing controller 108 directs the buffer 105B to store the next portion of 

25 the input Ia data stream at the next available data storage location 107 allocated for Ia-O in 
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the buffer 105B. The next portion of the input Ia data stream stored in buffer 105B at the 
clock cycle 9 is indicated by Ia.0.9, wherein the portion identifier is 9. 
[77] Also, at the clock cycle 9, the selector signal Sw2 is { 1000} which instructs MUX 
103C to transmit a next portion of the input Id data stream through output W2 to buffer 
5 105C. The addressing controller 108 directs the buffer 105C to store the next portion of 
the input Id data stream at the next available data storage location 107 allocated for Id-O in 
the buffer 105C. The next portion of the input Id data stream stored in buffer 105C at the 
clock cycle 9 is indicated by Id.0.6, wherein the portion identifier is 6. 
[78] Also, at the clock cycle 9, the selector signal Sw3 is {0100} which instructs MUX 

10 103D to transmit a next portion of the input Ic data stream through output W3 to buffer 
105D. The addressing controller 108 directs the buffer 105D to store the next portion of 
the input Ic data stream at the next available data storage location 107 allocated for Ic-O in 
the buffer 105D. The next portion of the input Ic data stream stored in buffer 105D at the 
clock cycle 9 is indicated by Ic.0.7, wherein the portion identifier is 7. 

15 [79] Also, at the clock cycle 9, Sro is {0010}. Therefore, at the clock cycle 9, MUX 
103E transmits the Ic-O.l portion of the input Ic data stream from the buffer 105B output 
Ri to output Oa. 

[80] Also, at the clock cycle 9, Sr3 is {0100}. Therefore, at the clock cycle 9, MUX 
103H transmits the Ib.0.2 portion of the input Ib data stream from the buffer 105C output 
20 R2 to output Od. 

[81] Also, at the clock cycle 9, Sr2 is {1000}. Therefore, at the clock cycle 9, MUX 
103G transmits the Ia.0.3 portion of the input Ia data stream from the buffer 105D output 
R3 to output Oc. 
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[82] Also, at the clock cycle 9, Sri is {0001}. Therefore, at the clock cycle 9, MUX 
103F transmits the Id.0.4 portion of the input Id data stream from the buffer 105A output 
Ro to output Ob. 

[83] In this example Sro-Sr3 are composed of the same signals that make up Swo-Sws- 
5 However, they may also be generated by a distinct rotating selector. 

[84] As demonstrated by the example memory 100 and rotating selector 200 operation 
shown in Figures 4A-13B, when a data stream is deposited in the memory 100, the data 
stream is interleaved over a number of memory banks included in the memory 100. Each 
of the number of memory banks is independently addressable. In one embodiment, each 

10 of the number of memory banks is also dual ported to be written to and read from 
simultaneously. However, in other embodiments, memory banks that are not dual ported 
can also be used. The memory 100 is capable of simultaneously receiving a number of 
incoming data stream transmissions and providing a number of outgoing data stream 
transmissions. Each of the incoming and outgoing data stream transmissions can be 

15 independently started and stopped by a memory controller. In one embodiment, once an 
incoming or outgoing data stream transmission is started, the data stream transmission 
tends to continue for a while before an idle period is encountered. The tendency of the 
data stream transmission to continue is not a necessary condition of the present invention, 
but is a characteristic of switch traffic that can be exploited by the present invention for 

20 better performance. 

[85] As previously discussed, the memory controller instructs the memory 100 to begin 
storing data streams incoming at inputs Ia, Ib, Ic, and Id when Swo is {0001}, {0010}, 
{0100}, and {1000}, respectively. If storage of the incoming data streams cannot be 
started or stopped immediately (i.e., if a latency exists from the time a start or stop 

25 instruction is issued by the memory controller to the time the incoming data stream 

SUNMP232/ASP/KDW 30 PATENT APPLICATION 



actually starts or stops), an elasticity buffer can be added before each of the inputs Ia, Ib, 
Ic, and Id. The elasticity buffer can be used to absorb data that continues to arrive after a 
stop instruction is issued by the memory controller, and to buffer up data that arrives prior 
to execution of a start instruction issued by the memory controller. In the example 
5 memory 100, each of the inputs U, Ib, Ic, and Id will have to wait at most four clock 
cycles before execution of a start instruction issued by the memory controller. Therefore, 
the elasticity buffer should be large enough to handle an amount of data that can arrive in 
an incoming data stream over four clock cycles. 

[86] Since each memory bank is assigned to a different input and output in every clock 

10 cycle, the addressing controller 108 changes the write and read addresses for each 
memory bank in every clock cycle to point to the appropriate data storage location. In one 
embodiment, management of the addresses for each memory bank by the addressing 
controller 108 can be performed using a time slicing device similar to that described in 
the patent application entitled "Time Slicing Device for Shared Resources and Method for 

15 Operating the Same," as previously incorporated herein by reference. 

[87] The memory 100 as previously described assumes that all of the inputs and all of 
the outputs have the same bandwidth characteristics. In applications where there exist 
inputs of differing bandwidth characteristics, a bandwidth matching device similar to that 
described in the patent application entitled "Bandwidth Matching Device and Method for 

20 Operating the Same," as previously incorporated herein by reference, can be used. The 
bandwidth matching device is capable of transforming the input bandwidths to satisfy a 
common bandwidth requirement before being sent to the ratcheting distributor. For 
example, if there are four Ix bandwidth inputs and three 4x bandwidth inputs, the four Ix 
inputs can be merged into a single 4x stream using the bandwidth matching device. Then, 

25 the resulting four 4x streams can be sent to the ratcheting distributor. Alternatively, the 
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three 4x bandwidth inputs can be transformed into twelve Ix streams. Then, the resulting 
sixteen Ix streams can be sent to the ratcheting distributor. 

[88] Figure 14 shows a flowchart illustrating a method for operating a memory, in 
accordance with one embodiment of the present invention. The method includes an 
5 operation 1401 in which the memory receives a number of inputs that each represent a 
data stream to be stored in the memory. The memory is operated with a capability to 
receive each of the number of inputs simultaneously. The method also includes an 
operation 1403 in which the data stream associated with each of the number of inputs is 
distributed across a number of memory banks. A portion of the data stream is stored in 

10 each of the number of memory banks as required to completely store the data stream. 

[89] A ratcheting distributor can be used to distribute the data stream associated with 
each of the number of inputs across the number of memory banks. Use of the ratcheting 
distributor includes operation of a number of multiplexers. When operated, each of the 
number of multiplexers receive each of the number of inputs and a selector signal. The 

15 selector signal controls which of the number of inputs is transmitted to a multiplexer 
output that is connected to one of the number of memory banks. Use of the ratcheting 
distributor further includes generating the selector signal received by each of the number 
of multiplexers. The generated selector signal is different for each of the number of 
multiplexers on each cycle of a clock. However, the number of different selector signals 

20 is equal to the number of multiplexers. The selector signal received by each of the number 
of multiplexers is repeated on a clock cycle that is a multiple of the number of inputs. 
[90] The method further includes an operation 1405 in which the portion of the data 
stream stored in each of the number of memory banks is retrieved. In an operation 1407, 
the retrieved portions of the data stream are transmitted to an output. Transmission of the 

25 retrieved portions of the data stream to the output cause the data stream to be provided at 
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the output. The memory is operated with a capability to simultaneously retrieve and 
transmit portions of different data streams stored in each of the number of memory banks, 
such that each of the number of data streams are transmitted to different outputs. 
[91] A second ratcheting distributor can be used to retrieve and transmit the portion of 
5 the data stream stored in each of the number of memory banks. Use of the second 
ratcheting distributor includes operation of a number of multiplexers. When operated, 
each of the number of multiplexers receive the portion of the data stream stored in each of 
the number of memory banks and a selector signal. The selector signal controls which 
portion of the data stream stored in each of the number of memory banks is transmitted to 

10 the output. Use of the ratcheting distributor further includes generating the selector signal 
received by each of the number of multiplexers. The generated selector signal is different 
for each of the number of multiplexers on each cycle of a clock. However, the number of 
different selector signals is equal to the number of multiplexers. The selector signal 
received by each of the number of multiplexers is repeated on a clock cycle that is a 

15 multiple of the number of memory banks. 

[92] While this invention has been described in terms of several embodiments, it will 
be appreciated that those skilled in the art upon reading the preceding specifications and 
studying the drawings will realize various alterations, additions, permutations and 
equivalents thereof. It is therefore intended that the present invention includes all such 

20 alterations, additions, permutations, and equivalents as fall within the true spirit and 
scope of the invention. 

What is claimed is: 
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